Influence of Data Preprocessing

نویسندگان
چکیده

منابع مشابه

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

Gene expression data preprocessing

We present an interactive web tool for preprocessing microarray gene expression data. It analyses the data, suggests the most appropriate transformations and proceeds with them after user agreement. The normal preprocessing steps include scale transformations, management of missing values, replicate handling, flat pattern filtering and pattern standardization and they are required before perfor...

متن کامل

Chapter 15: Data Preprocessing

Data preprocessing converts raw data and signals into data representation suitable for application through a sequence of operations. The objectives of data preprocessing include size reduction of the input space, smoother relationships, data normalization, noise reduction, and feature extraction. Several data preprocessing algorithms, such as data values averaging, input space reduction, and da...

متن کامل

Preprocessing of NMR metabolomics data.

Metabolomics involves the large scale analysis of metabolites and thus, provides information regarding cellular processes in a biological sample. Independently of the analytical technique used, a vast amount of data is always acquired when carrying out metabolomics studies; this results in complex datasets with large amounts of variables. This type of data requires multivariate statistical anal...

متن کامل

Influence of data preprocessing on prediction of complex valued load time series

Exploitation of modern power systems requires prediction of the electrical load time series for operation of power utilities and load estimates for market operation and system planning. Increase of energy produced from renewable sources and deregulation of electrical energy market makes load prediction more important nowadays. By its nature, electrical load time series are highly non linear and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computing Science and Engineering

سال: 2016

ISSN: 1976-4677

DOI: 10.5626/jcse.2016.10.2.51